A Better Approach for Horizontal Aggregations in SQL Using Data Sets for Data Mining Analysis

نویسندگان

  • Y. Chakravarthi
  • P. Vindhya
چکیده

To analyzing the data efficiently in Data mining systems are widely using datasets with columns in horizontal tabular layout. Generally preparing a data set is the more complex task in a data mining project, require many complex SQL queries, aggregating columns and joining tables. Conventional RDBMS usually manage tables with vertical form. Aggregated columns in a horizontal tabular layout returns set of numbers, instead of one number per row. This new class of function is called horizontal aggregations. The system uses one parent table and different child tables, operations are then performed on the data loaded from multiple tables. We proposed three fundamental methods .They are SPJ (select-project-join-Aggregation), CASE, and PIVOT. SPJ based on standard relational algebra operators. CASE is useful to exploiting the programming case construct. PIVOT is a built-in operator in a commercial DBMS, PIVOT operator, offered by RDBMS is used to calculate aggregate operations. PIVOT methods are much faster methods and offer much scalability. Partitioning large set of data, obtained from the result of horizontal aggregation. Key Terms: Aggregation; Data Mining; Data preparation; Structured Query Language (SQL); Pivot Full Text: http://www.ijcsmc.com/docs/papers/August2013/V2I8201353.pdf

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prepare and Optimize Data Sets for Data Mining Analysis

Getting ready a data set for examination is usually the tedious errand in a data mining task, needing numerous complex SQL queries, joining tables and conglomerating sections. Existing SQL aggregations have limitations to get ready data sets since they give back one section for every amassed bunch. As a rule, a significant manual exertion is obliged to construct data sets, where a horizontal la...

متن کامل

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

Data mining is widely used domain for extracting trends or patterns from historical data. However, the databases used by enterprises can’t be directly used for data mining. It does mean that Data sets are to be prepared from real world database to make them suitable for particular data mining operations. However, preparing datasets for analyzing data is tedious task as it involves many aggregat...

متن کامل

Anomaly Detection and SQL Prepare Data Sets for Data Mining

Anomaly detection has been an important research topic in data mining and machine learning. Many real-world applications such as intrusion or credit card fraud detection require an effective and efficient framework to identify deviated data instances. However, most anomaly detection methods are typically implemented in batch mode, and thus cannot be easily extended to large-scale problems witho...

متن کامل

K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizontal Aggregations

Data mining refers to the process of analyzing the data from different perspectives and summarizing it into useful information that is mostly used by the different users for analyzing the data as well as for preparing data sets. A data set is collection of data that is present in the tabular form. Preparing data set involves complex SQL queries, joining tables and aggregate functions. Tradition...

متن کامل

Preparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL

A huge amount of time is needed for making the dataset for the data mining analysis because data mining practitioners required to write complex SQL queries and many tables are to be joined to get the aggregated result. The traditional SQL aggregations prepare the data sets in vertical layout that is; they return result on one column per aggregated group. But for the data mining project, the dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013